Overview

Dataset statistics

Number of variables53
Number of observations180519
Missing cells336209
Missing cells (%)3.5%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory333.7 MiB
Average record size in memory1.9 KiB

Variable types

NUM25
CAT24
BOOL2
URL1
UNSUPPORTED1

Reproduction

Analysis started2020-04-08 07:35:13.153937
Analysis finished2020-04-08 07:54:11.098164
Versionpandas-profiling v2.5.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml
Customer City has a high cardinality: 563 distinct values High cardinality
Customer Fname has a high cardinality: 782 distinct values High cardinality
Customer Lname has a high cardinality: 1109 distinct values High cardinality
Customer Street has a high cardinality: 7458 distinct values High cardinality
Order City has a high cardinality: 3597 distinct values High cardinality
Order Country has a high cardinality: 164 distinct values High cardinality
order date (DateOrders) has a high cardinality: 65752 distinct values High cardinality
Order State has a high cardinality: 1089 distinct values High cardinality
Product Name has a high cardinality: 118 distinct values High cardinality
shipping date (DateOrders) has a high cardinality: 63701 distinct values High cardinality
Longitude is highly correlated with Customer ZipcodeHigh Correlation
Customer Zipcode is highly correlated with LongitudeHigh Correlation
Order Customer Id is highly correlated with Customer IdHigh Correlation
Customer Id is highly correlated with Order Customer IdHigh Correlation
Order Item Cardprod Id is highly correlated with Category Id and 3 other fieldsHigh Correlation
Category Id is highly correlated with Order Item Cardprod Id and 2 other fieldsHigh Correlation
Department Id is highly correlated with Order Item Cardprod Id and 1 other fieldsHigh Correlation
Order Item Id is highly correlated with Order IdHigh Correlation
Order Id is highly correlated with Order Item IdHigh Correlation
Sales is highly correlated with Sales per customer and 1 other fieldsHigh Correlation
Sales per customer is highly correlated with Sales and 1 other fieldsHigh Correlation
Order Item Total is highly correlated with Sales per customer and 1 other fieldsHigh Correlation
Order Profit Per Order is highly correlated with Benefit per orderHigh Correlation
Benefit per order is highly correlated with Order Profit Per OrderHigh Correlation
Product Card Id is highly correlated with Category Id and 3 other fieldsHigh Correlation
Product Category Id is highly correlated with Category Id and 2 other fieldsHigh Correlation
Product Price is highly correlated with Order Item Product PriceHigh Correlation
Order Item Product Price is highly correlated with Product PriceHigh Correlation
Customer State is highly correlated with Customer CountryHigh Correlation
Customer Country is highly correlated with Customer StateHigh Correlation
Department Name is highly correlated with Category NameHigh Correlation
Category Name is highly correlated with Department NameHigh Correlation
Order Region is highly correlated with MarketHigh Correlation
Market is highly correlated with Order RegionHigh Correlation
Order Status is highly correlated with TypeHigh Correlation
Type is highly correlated with Order StatusHigh Correlation
Shipping Mode is highly correlated with Days for shipment (scheduled)High Correlation
Days for shipment (scheduled) is highly correlated with Shipping ModeHigh Correlation
Order Zipcode has 155679 (86.2%) missing values Missing
Product Description has 180519 (100.0%) missing values Missing
Product Description is an unsupported type, check if it needs cleaning or further analysis Rejected
Days for shipping (real) has 5080 (2.8%) zeros Zeros
Order Item Discount has 10028 (5.6%) zeros Zeros
Order Item Discount Rate has 10028 (5.6%) zeros Zeros

Variables

Type
Categorical

HIGH CORRELATION
Distinct count4
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.4 MiB
DEBIT
69295
TRANSFER
49883
PAYMENT
41725
CASH
19616
ValueCountFrequency (%) 
DEBIT 69295 38.4%
 
TRANSFER 49883 27.6%
 
PAYMENT 41725 23.1%
 
CASH 19616 10.9%
 

Length

Max length8
Mean length6.182606817
Min length4
ValueCountFrequency (%) 
Uppercase_Letter 15 100.0%
 
ValueCountFrequency (%) 
Latin 15 100.0%
 
ValueCountFrequency (%) 
ASCII 15 100.0%
 

Days for shipping (real)
Real number (ℝ≥0)

ZEROS
Distinct count7
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.497653987
Minimum0
Maximum6
Zeros5080
Zeros (%)2.8%
Memory size1.4 MiB

Quantile statistics

Minimum0
5-th percentile1
Q12
median3
Q35
95-th percentile6
Maximum6
Range6
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.623721828
Coefficient of variation (CV)0.464231692
Kurtosis-1.007913583
Mean3.497653987
Median Absolute Deviation (MAD)1.423770874
Skewness0.084771273
Sum631393
Variance2.636472576
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0. 0.5 1.5 2.5 5.5 6. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2 56618 31.4%
 
3 28765 15.9%
 
6 28723 15.9%
 
4 28513 15.8%
 
5 28163 15.6%
 
0 5080 2.8%
 
1 4657 2.6%
 
ValueCountFrequency (%) 
0 5080 2.8%
 
1 4657 2.6%
 
2 56618 31.4%
 
3 28765 15.9%
 
4 28513 15.8%
 
ValueCountFrequency (%) 
6 28723 15.9%
 
5 28163 15.6%
 
4 28513 15.8%
 
3 28765 15.9%
 
2 56618 31.4%
 

Days for shipment (scheduled)
Categorical

HIGH CORRELATION
Distinct count4
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.4 MiB
4
107752
2
35216
1
27814
0
 
9737
ValueCountFrequency (%) 
4 107752 59.7%
 
2 35216 19.5%
 
1 27814 15.4%
 
0 9737 5.4%
 

Length

Max length1
Mean length1
Min length1
ValueCountFrequency (%) 
Decimal_Number 4 100.0%
 
ValueCountFrequency (%) 
Common 4 100.0%
 
ValueCountFrequency (%) 
ASCII 4 100.0%
 

Benefit per order
Real number (ℝ)

HIGH CORRELATION
Distinct count21998
Unique (%)12.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean21.97498864
Minimum-4274.97998
Maximum911.7999878
Zeros1177
Zeros (%)0.7%
Memory size1.4 MiB

Quantile statistics

Minimum-4274.97998
5-th percentile-139.2509994
Q17
median31.52000046
Q364.80000305
95-th percentile132.2899933
Maximum911.7999878
Range5186.779968
Interquartile range (IQR)57.80000305

Descriptive statistics

Standard deviation104.4335257
Coefficient of variation (CV)4.752381331
Kurtosis71.37725866
Mean21.97498864
Median Absolute Deviation (MAD)56.08869441
Skewness-4.74183407
Sum3966902.974
Variance10906.3613
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[-4274.97998 -1355.61499 -1083.169983 -824.94000245 -674.98498535 ... 239.8199997 240.19000245 245.32499695 720.9499817 911.7999878 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 1177 0.7%
 
143.9900055 199 0.1%
 
72 194 0.1%
 
46.79999924 188 0.1%
 
24 181 0.1%
 
18 175 0.1%
 
63.70000076 172 0.1%
 
62.40000153 168 0.1%
 
14.39999962 166 0.1%
 
12 166 0.1%
 
Other values (21988) 177733 98.5%
 
ValueCountFrequency (%) 
-4274.97998 1 < 0.1%
 
-3442.5 1 < 0.1%
 
-3366 1 < 0.1%
 
-3000 1 < 0.1%
 
-2592 1 < 0.1%
 
ValueCountFrequency (%) 
911.7999878 1 < 0.1%
 
864 1 < 0.1%
 
721.5999756 1 < 0.1%
 
720.2999878 1 < 0.1%
 
720 2 < 0.1%
 

Sales per customer
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count2927
Unique (%)1.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean183.1076085
Minimum7.489999771
Maximum1939.98999
Zeros0
Zeros (%)0.0%
Memory size1.4 MiB

Quantile statistics

Minimum7.489999771
5-th percentile41.5
Q1104.3799973
median163.9900055
Q3247.3999939
95-th percentile383.980011
Maximum1939.98999
Range1932.49999
Interquartile range (IQR)143.0199966

Descriptive statistics

Standard deviation120.04367
Coefficient of variation (CV)0.6555908354
Kurtosis23.92036151
Mean183.1076085
Median Absolute Deviation (MAD)87.66090517
Skewness2.888446057
Sum33054402.38
Variance14410.48271
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 7.48999977 8.43000031 8.48000002 8.57499981 8.67499971 ... 1057.4949951 1215. 1492.494995 1549.994995 1939.98999 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
122.8399963 1264 0.7%
 
109.1900024 1247 0.7%
 
114.3899994 1243 0.7%
 
113.0899963 1243 0.7%
 
129.9900055 1243 0.7%
 
126.0899963 1243 0.7%
 
107.8899994 1243 0.7%
 
123.4899979 1243 0.7%
 
97.48999786 1243 0.7%
 
110.4899979 1243 0.7%
 
Other values (2917) 168064 93.1%
 
ValueCountFrequency (%) 
7.489999771 3 < 0.1%
 
7.989999771 3 < 0.1%
 
8.18999958 3 < 0.1%
 
8.289999962 3 < 0.1%
 
8.390000343 3 < 0.1%
 
ValueCountFrequency (%) 
1939.98999 1 < 0.1%
 
1919.98999 1 < 0.1%
 
1899.98999 1 < 0.1%
 
1889.98999 1 < 0.1%
 
1859.98999 1 < 0.1%
 

Delivery Status
Categorical

Distinct count4
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.4 MiB
Late delivery
98977
Advance shipping
41592
Shipping on time
32196
Shipping canceled
 
7754
ValueCountFrequency (%) 
Late delivery 98977 54.8%
 
Advance shipping 41592 23.0%
 
Shipping on time 32196 17.8%
 
Shipping canceled 7754 4.3%
 

Length

Max length17
Mean length14.39807998
Min length13
ValueCountFrequency (%) 
Lowercase_Letter 17 81.0%
 
Uppercase_Letter 3 14.3%
 
Space_Separator 1 4.8%
 
ValueCountFrequency (%) 
Latin 20 95.2%
 
Common 1 4.8%
 
ValueCountFrequency (%) 
ASCII 21 100.0%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.4 MiB
1
98977
0
81542
ValueCountFrequency (%) 
1 98977 54.8%
 
0 81542 45.2%
 

Category Id
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count51
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean31.85145054
Minimum2
Maximum76
Zeros0
Zeros (%)0.0%
Memory size1.4 MiB

Quantile statistics

Minimum2
5-th percentile9
Q118
median29
Q345
95-th percentile48
Maximum76
Range74
Interquartile range (IQR)27

Descriptive statistics

Standard deviation15.64006388
Coefficient of variation (CV)0.4910314481
Kurtosis-0.6032610083
Mean31.85145054
Median Absolute Deviation (MAD)13.94913209
Skewness0.3616247994
Sum5749792
Variance244.6115983
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 2. 2.5 3.5 4.5 6.5 ... 72.5 73.5 74.5 75.5 76. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
17 24551 13.6%
 
18 22246 12.3%
 
24 21035 11.7%
 
46 19298 10.7%
 
45 17325 9.6%
 
48 15540 8.6%
 
43 13729 7.6%
 
9 12487 6.9%
 
29 10984 6.1%
 
37 2029 1.1%
 
Other values (41) 21295 11.8%
 
ValueCountFrequency (%) 
2 138 0.1%
 
3 632 0.4%
 
4 67 < 0.1%
 
5 343 0.2%
 
6 328 0.2%
 
ValueCountFrequency (%) 
76 650 0.4%
 
75 838 0.5%
 
74 529 0.3%
 
73 357 0.2%
 
72 492 0.3%
 

Category Name
Categorical

HIGH CORRELATION
Distinct count50
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.4 MiB
Cleats
24551
Men's Footwear
22246
Women's Apparel
21035
Indoor/Outdoor Games
19298
Fishing
17325
Other values (45)
76064
ValueCountFrequency (%) 
Cleats 24551 13.6%
 
Men's Footwear 22246 12.3%
 
Women's Apparel 21035 11.7%
 
Indoor/Outdoor Games 19298 10.7%
 
Fishing 17325 9.6%
 
Water Sports 15540 8.6%
 
Camping & Hiking 13729 7.6%
 
Cardio Equipment 12487 6.9%
 
Shop By Sport 10984 6.1%
 
Electronics 3156 1.7%
 
Other values (40) 20168 11.2%
 

Length

Max length20
Mean length12.70779253
Min length4
ValueCountFrequency (%) 
Lowercase_Letter 24 49.0%
 
Uppercase_Letter 19 38.8%
 
Other_Punctuation 4 8.2%
 
Dash_Punctuation 1 2.0%
 
Space_Separator 1 2.0%
 
ValueCountFrequency (%) 
Latin 43 87.8%
 
Common 6 12.2%
 
ValueCountFrequency (%) 
ASCII 49 100.0%
 

Customer City
Categorical

HIGH CARDINALITY
Distinct count563
Unique (%)0.3%
Missing0
Missing (%)0.0%
Memory size1.4 MiB
Caguas
66770
Chicago
 
3885
Los Angeles
 
3417
Brooklyn
 
3412
New York
 
1816
Other values (558)
101219
ValueCountFrequency (%) 
Caguas 66770 37.0%
 
Chicago 3885 2.2%
 
Los Angeles 3417 1.9%
 
Brooklyn 3412 1.9%
 
New York 1816 1.0%
 
Philadelphia 1577 0.9%
 
Bronx 1500 0.8%
 
San Diego 1437 0.8%
 
Miami 1314 0.7%
 
Houston 1297 0.7%
 
Other values (553) 94094 52.1%
 

Length

Max length20
Mean length7.708623469
Min length2
ValueCountFrequency (%) 
Lowercase_Letter 26 50.0%
 
Uppercase_Letter 25 48.1%
 
Space_Separator 1 1.9%
 
ValueCountFrequency (%) 
Latin 51 98.1%
 
Common 1 1.9%
 
ValueCountFrequency (%) 
ASCII 52 100.0%
 

Customer Country
Categorical

HIGH CORRELATION
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.4 MiB
EE. UU.
111146
Puerto Rico
69373
ValueCountFrequency (%) 
EE. UU. 111146 61.6%
 
Puerto Rico 69373 38.4%
 

Length

Max length11
Mean length8.537189991
Min length7
ValueCountFrequency (%) 
Lowercase_Letter 7 53.8%
 
Uppercase_Letter 4 30.8%
 
Space_Separator 1 7.7%
 
Other_Punctuation 1 7.7%
 
ValueCountFrequency (%) 
Latin 11 84.6%
 
Common 2 15.4%
 
ValueCountFrequency (%) 
ASCII 13 100.0%
 

Customer Email
Categorical

CONSTANT
REJECTED
Distinct count1
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.4 MiB
XXXXXXXXX
180519
ValueCountFrequency (%) 
XXXXXXXXX 180519 100.0%
 

Length

Max length9
Mean length9
Min length9
ValueCountFrequency (%) 
Uppercase_Letter 1 100.0%
 
ValueCountFrequency (%) 
Latin 1 100.0%
 
ValueCountFrequency (%) 
ASCII 1 100.0%
 

Customer Fname
Categorical

HIGH CARDINALITY
Distinct count782
Unique (%)0.4%
Missing0
Missing (%)0.0%
Memory size1.4 MiB
Mary
65150
James
 
1835
Robert
 
1759
Michael
 
1680
David
 
1625
Other values (777)
108470
ValueCountFrequency (%) 
Mary 65150 36.1%
 
James 1835 1.0%
 
Robert 1759 1.0%
 
Michael 1680 0.9%
 
David 1625 0.9%
 
John 1446 0.8%
 
William 1365 0.8%
 
Joseph 1117 0.6%
 
Jennifer 1033 0.6%
 
Richard 1032 0.6%
 
Other values (772) 102477 56.8%
 

Length

Max length11
Mean length5.274658069
Min length2
ValueCountFrequency (%) 
Lowercase_Letter 26 50.0%
 
Uppercase_Letter 26 50.0%
 
ValueCountFrequency (%) 
Latin 52 100.0%
 
ValueCountFrequency (%) 
ASCII 52 100.0%
 

Customer Id
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count20652
Unique (%)11.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6691.379495
Minimum1
Maximum20757
Zeros0
Zeros (%)0.0%
Memory size1.4 MiB

Quantile statistics

Minimum1
5-th percentile649
Q13258.5
median6457
Q39779
95-th percentile12383
Maximum20757
Range20756
Interquartile range (IQR)6520.5

Descriptive statistics

Standard deviation4162.918106
Coefficient of variation (CV)0.6221315215
Kurtosis0.01489882226
Mean6691.379495
Median Absolute Deviation (MAD)3447.119576
Skewness0.4887682515
Sum1207921135
Variance17329887.16
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[1.00000e+00 1.03500e+02 1.22500e+02 1.24500e+02 1.36500e+02 ... 1.24055e+04 1.24305e+04 1.24325e+04 1.24355e+04 2.07570e+04], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
5654 47 < 0.1%
 
5004 45 < 0.1%
 
10591 45 < 0.1%
 
5715 44 < 0.1%
 
3708 44 < 0.1%
 
9371 44 < 0.1%
 
1443 43 < 0.1%
 
2641 43 < 0.1%
 
791 43 < 0.1%
 
12284 43 < 0.1%
 
Other values (20642) 180078 99.8%
 
ValueCountFrequency (%) 
1 1 < 0.1%
 
2 10 < 0.1%
 
3 18 < 0.1%
 
4 14 < 0.1%
 
5 7 < 0.1%
 
ValueCountFrequency (%) 
20757 1 < 0.1%
 
20756 1 < 0.1%
 
20755 1 < 0.1%
 
20754 1 < 0.1%
 
20753 1 < 0.1%
 

Customer Lname
Categorical

HIGH CARDINALITY
Distinct count1109
Unique (%)0.6%
Missing8
Missing (%)< 0.1%
Memory size1.4 MiB
Smith
64104
Johnson
 
989
Brown
 
909
Williams
 
869
Jones
 
859
Other values (1104)
112781
ValueCountFrequency (%) 
Smith 64104 35.5%
 
Johnson 989 0.5%
 
Brown 909 0.5%
 
Williams 869 0.5%
 
Jones 859 0.5%
 
Garcia 724 0.4%
 
Wilson 675 0.4%
 
Taylor 661 0.4%
 
Davis 640 0.4%
 
Moore 599 0.3%
 
Other values (1099) 109482 60.6%
 

Length

Max length12
Mean length5.712290673
Min length2
ValueCountFrequency (%) 
Lowercase_Letter 26 51.0%
 
Uppercase_Letter 25 49.0%
 
ValueCountFrequency (%) 
Latin 51 100.0%
 
ValueCountFrequency (%) 
ASCII 51 100.0%
 

Customer Password
Categorical

CONSTANT
REJECTED
Distinct count1
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.4 MiB
XXXXXXXXX
180519
ValueCountFrequency (%) 
XXXXXXXXX 180519 100.0%
 

Length

Max length9
Mean length9
Min length9
ValueCountFrequency (%) 
Uppercase_Letter 1 100.0%
 
ValueCountFrequency (%) 
Latin 1 100.0%
 
ValueCountFrequency (%) 
ASCII 1 100.0%
 

Customer Segment
Categorical

Distinct count3
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.4 MiB
Consumer
93504
Corporate
54789
Home Office
32226
ValueCountFrequency (%) 
Consumer 93504 51.8%
 
Corporate 54789 30.4%
 
Home Office 32226 17.9%
 

Length

Max length11
Mean length8.839064032
Min length8
ValueCountFrequency (%) 
Lowercase_Letter 13 76.5%
 
Uppercase_Letter 3 17.6%
 
Space_Separator 1 5.9%
 
ValueCountFrequency (%) 
Latin 16 94.1%
 
Common 1 5.9%
 
ValueCountFrequency (%) 
ASCII 17 100.0%
 

Customer State
Categorical

HIGH CORRELATION
Distinct count46
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.4 MiB
PR
69373
CA
29223
NY
 
11327
TX
 
9103
IL
 
7631
Other values (41)
53862
ValueCountFrequency (%) 
PR 69373 38.4%
 
CA 29223 16.2%
 
NY 11327 6.3%
 
TX 9103 5.0%
 
IL 7631 4.2%
 
FL 5456 3.0%
 
OH 4095 2.3%
 
PA 3824 2.1%
 
MI 3804 2.1%
 
NJ 3191 1.8%
 
Other values (36) 33492 18.6%
 

Length

Max length5
Mean length2.000049856
Min length2
ValueCountFrequency (%) 
Uppercase_Letter 24 77.4%
 
Decimal_Number 7 22.6%
 
ValueCountFrequency (%) 
Latin 24 77.4%
 
Common 7 22.6%
 
ValueCountFrequency (%) 
ASCII 31 100.0%
 

Customer Street
Categorical

HIGH CARDINALITY
Distinct count7458
Unique (%)4.1%
Missing0
Missing (%)0.0%
Memory size1.4 MiB
9126 Wishing Expressway
 
122
4388 Burning Goose Ridge
 
117
4720 Noble Hills Wynd
 
116
2878 Hazy Wagon Thicket
 
113
398 Emerald Grove
 
109
Other values (7453)
179942
ValueCountFrequency (%) 
9126 Wishing Expressway 122 0.1%
 
4388 Burning Goose Ridge 117 0.1%
 
4720 Noble Hills Wynd 116 0.1%
 
2878 Hazy Wagon Thicket 113 0.1%
 
398 Emerald Grove 109 0.1%
 
3593 Blue Brook Acres 108 0.1%
 
6289 Rocky Way 107 0.1%
 
2210 Merry Leaf Row 107 0.1%
 
2585 Silent Autumn Landing 105 0.1%
 
141 Dewy Plaza 103 0.1%
 
Other values (7448) 179412 99.4%
 

Length

Max length33
Mean length19.95711255
Min length8
ValueCountFrequency (%) 
Lowercase_Letter 25 41.0%
 
Uppercase_Letter 24 39.3%
 
Decimal_Number 10 16.4%
 
Dash_Punctuation 1 1.6%
 
Space_Separator 1 1.6%
 
ValueCountFrequency (%) 
Latin 49 80.3%
 
Common 12 19.7%
 
ValueCountFrequency (%) 
ASCII 61 100.0%
 

Customer Zipcode
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count995
Unique (%)0.6%
Missing3
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean35921.12691
Minimum603
Maximum99205
Zeros0
Zeros (%)0.0%
Memory size1.4 MiB

Quantile statistics

Minimum603
5-th percentile725
Q1725
median19380
Q378207
95-th percentile94538
Maximum99205
Range98602
Interquartile range (IQR)77482

Descriptive statistics

Standard deviation37542.46112
Coefficient of variation (CV)1.045135951
Kurtosis-1.451419372
Mean35921.12691
Median Absolute Deviation (MAD)34305.8828
Skewness0.4908834095
Sum6484338146
Variance1409436387
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
725 66770 37.0%
 
921 337 0.2%
 
23455 334 0.2%
 
957 297 0.2%
 
79109 292 0.2%
 
33324 283 0.2%
 
80012 280 0.2%
 
33624 261 0.1%
 
92115 256 0.1%
 
92024 254 0.1%
 
Other values (985) 111152 61.6%
 
ValueCountFrequency (%) 
603 50 < 0.1%
 
612 122 0.1%
 
674 169 0.1%
 
680 133 0.1%
 
685 126 0.1%
 
ValueCountFrequency (%) 
99205 43 < 0.1%
 
98632 67 < 0.1%
 
98390 28 < 0.1%
 
98226 47 < 0.1%
 
98208 95 0.1%
 

Department Id
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count11
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.443460245
Minimum2
Maximum12
Zeros0
Zeros (%)0.0%
Memory size1.4 MiB

Quantile statistics

Minimum2
5-th percentile3
Q14
median5
Q37
95-th percentile7
Maximum12
Range10
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.629246035
Coefficient of variation (CV)0.2993033772
Kurtosis-0.1816965071
Mean5.443460245
Median Absolute Deviation (MAD)1.43459705
Skewness0.2733206291
Sum982648
Variance2.654442643
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 2. 3.5 4.5 5.5 6.5 ... 8.5 9.5 10.5 11.5 12. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
7 66861 37.0%
 
4 48998 27.1%
 
5 33220 18.4%
 
3 14525 8.0%
 
6 9686 5.4%
 
2 2479 1.4%
 
9 2026 1.1%
 
10 1465 0.8%
 
11 492 0.3%
 
8 405 0.2%
 
ValueCountFrequency (%) 
2 2479 1.4%
 
3 14525 8.0%
 
4 48998 27.1%
 
5 33220 18.4%
 
6 9686 5.4%
 
ValueCountFrequency (%) 
12 362 0.2%
 
11 492 0.3%
 
10 1465 0.8%
 
9 2026 1.1%
 
8 405 0.2%
 

Department Name
Categorical

HIGH CORRELATION
Distinct count11
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.4 MiB
Fan Shop
66861
Apparel
48998
Golf
33220
Footwear
14525
Outdoors
 
9686
Other values (6)
 
7229
ValueCountFrequency (%) 
Fan Shop 66861 37.0%
 
Apparel 48998 27.1%
 
Golf 33220 18.4%
 
Footwear 14525 8.0%
 
Outdoors 9686 5.4%
 
Fitness 2479 1.4%
 
Discs Shop 2026 1.1%
 
Technology 1465 0.8%
 
Pet Shop 492 0.3%
 
Book Shop 405 0.2%
 

Length

Max length18
Mean length7.039713271
Min length4
ValueCountFrequency (%) 
Lowercase_Letter 19 63.3%
 
Uppercase_Letter 10 33.3%
 
Space_Separator 1 3.3%
 
ValueCountFrequency (%) 
Latin 29 96.7%
 
Common 1 3.3%
 
ValueCountFrequency (%) 
ASCII 30 100.0%
 

Latitude
Real number (ℝ)

Distinct count11250
Unique (%)6.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean29.71995466
Minimum-33.93755341
Maximum48.78193283
Zeros0
Zeros (%)0.0%
Memory size1.4 MiB

Quantile statistics

Minimum-33.93755341
5-th percentile18.21293068
Q118.26543236
median33.14486313
Q339.27961731
95-th percentile42.39110184
Maximum48.78193283
Range82.71948624
Interquartile range (IQR)21.01418495

Descriptive statistics

Standard deviation9.813646327
Coefficient of variation (CV)0.3302039468
Kurtosis-1.555414897
Mean29.71995466
Median Absolute Deviation (MAD)9.089862493
Skewness-0.09796266623
Sum5365016.496
Variance96.30765423
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[-33.93755341 -7.97753143 17.99465561 18.02520657 18.02536583 ... 47.91032028 47.91926957 48.3484974 48.77644539 48.78193283], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
18.2275734 417 0.2%
 
39.49591446 370 0.2%
 
18.22757721 300 0.2%
 
36.91083145 280 0.2%
 
26.0984993 270 0.1%
 
18.38011932 267 0.1%
 
33.04647064 234 0.1%
 
32.75856018 218 0.1%
 
26.21472931 218 0.1%
 
40.64759827 212 0.1%
 
Other values (11240) 177733 98.5%
 
ValueCountFrequency (%) 
-33.93755341 9 < 0.1%
 
17.98249054 38 < 0.1%
 
18.00682068 22 < 0.1%
 
18.01836205 20 < 0.1%
 
18.02520371 17 < 0.1%
 
ValueCountFrequency (%) 
48.78193283 6 < 0.1%
 
48.77095795 41 < 0.1%
 
47.92603684 7 < 0.1%
 
47.91250229 4 < 0.1%
 
47.90813828 20 < 0.1%
 

Longitude
Real number (ℝ)

HIGH CORRELATION
Distinct count4487
Unique (%)2.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-84.91567469
Minimum-158.0259857
Maximum115.2630768
Zeros0
Zeros (%)0.0%
Memory size1.4 MiB

Quantile statistics

Minimum-158.0259857
5-th percentile-121.435112
Q1-98.44631195
median-76.84790802
Q3-66.37058258
95-th percentile-66.18312836
Maximum115.2630768
Range273.2890625
Interquartile range (IQR)32.07572937

Descriptive statistics

Standard deviation21.4332412
Coefficient of variation (CV)-0.2524061815
Kurtosis2.180982226
Mean-84.91567469
Median Absolute Deviation (MAD)17.88808103
Skewness-0.4984610726
Sum-15328892.68
Variance459.3838283
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[-158.0259857 -158.0131302 -158.00472255 -158.0046539 -157.9979477 ... 47.83176613 79.77704621 83.60233688 115.1564331 115.2630768 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
-66.3706131 3821 2.1%
 
-66.37057495 3523 2.0%
 
-66.37059021 3522 2.0%
 
-66.37050629 3465 1.9%
 
-66.37055206 3417 1.9%
 
-66.37052918 3408 1.9%
 
-66.3705368 3377 1.9%
 
-66.37055969 3242 1.8%
 
-66.37060547 3230 1.8%
 
-66.37058258 3155 1.7%
 
Other values (4477) 146359 81.1%
 
ValueCountFrequency (%) 
-158.0259857 73 < 0.1%
 
-158.016037 134 0.1%
 
-158.0102234 18 < 0.1%
 
-158.0047607 7 < 0.1%
 
-158.0046844 43 < 0.1%
 
ValueCountFrequency (%) 
115.2630768 17 < 0.1%
 
115.0497894 17 < 0.1%
 
84.74267578 8 < 0.1%
 
82.46199799 22 < 0.1%
 
77.09209442 32 < 0.1%
 

Market
Categorical

HIGH CORRELATION
Distinct count5
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.4 MiB
LATAM
51594
Europe
50252
Pacific Asia
41260
USCA
25799
Africa
11614
ValueCountFrequency (%) 
LATAM 51594 28.6%
 
Europe 50252 27.8%
 
Pacific Asia 41260 22.9%
 
USCA 25799 14.3%
 
Africa 11614 6.4%
 

Length

Max length12
Mean length6.799738532
Min length4
ValueCountFrequency (%) 
Lowercase_Letter 10 50.0%
 
Uppercase_Letter 9 45.0%
 
Space_Separator 1 5.0%
 
ValueCountFrequency (%) 
Latin 19 95.0%
 
Common 1 5.0%
 
ValueCountFrequency (%) 
ASCII 20 100.0%
 

Order City
Categorical

HIGH CARDINALITY
Distinct count3597
Unique (%)2.0%
Missing0
Missing (%)0.0%
Memory size1.4 MiB
Santo Domingo
 
2211
New York City
 
2202
Los Angeles
 
1845
Tegucigalpa
 
1783
Managua
 
1682
Other values (3592)
170796
ValueCountFrequency (%) 
Santo Domingo 2211 1.2%
 
New York City 2202 1.2%
 
Los Angeles 1845 1.0%
 
Tegucigalpa 1783 1.0%
 
Managua 1682 0.9%
 
Mexico City 1484 0.8%
 
Manila 1381 0.8%
 
Philadelphia 1302 0.7%
 
San Francisco 1297 0.7%
 
London 1187 0.7%
 
Other values (3587) 164145 90.9%
 

Length

Max length35
Mean length8.55421313
Min length2
ValueCountFrequency (%) 
Lowercase_Letter 42 53.2%
 
Uppercase_Letter 29 36.7%
 
Other_Punctuation 3 3.8%
 
Dash_Punctuation 1 1.3%
 
Space_Separator 1 1.3%
 
Close_Punctuation 1 1.3%
 
Open_Punctuation 1 1.3%
 
Control 1 1.3%
 
ValueCountFrequency (%) 
Latin 71 89.9%
 
Common 8 10.1%
 
ValueCountFrequency (%) 
ASCII 59 100.0%
 

Order Country
Categorical

HIGH CARDINALITY
Distinct count164
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size1.4 MiB
Estados Unidos
24840
Francia
 
13222
México
 
13172
Alemania
 
9564
Australia
 
8497
Other values (159)
111224
ValueCountFrequency (%) 
Estados Unidos 24840 13.8%
 
Francia 13222 7.3%
 
México 13172 7.3%
 
Alemania 9564 5.3%
 
Australia 8497 4.7%
 
Brasil 7987 4.4%
 
Reino Unido 7302 4.0%
 
China 5758 3.2%
 
Italia 4989 2.8%
 
India 4783 2.6%
 
Other values (154) 80405 44.5%
 

Length

Max length31
Mean length8.772827237
Min length4
ValueCountFrequency (%) 
Lowercase_Letter 32 52.5%
 
Uppercase_Letter 25 41.0%
 
Dash_Punctuation 1 1.6%
 
Close_Punctuation 1 1.6%
 
Space_Separator 1 1.6%
 
Open_Punctuation 1 1.6%
 
ValueCountFrequency (%) 
Latin 57 93.4%
 
Common 4 6.6%
 
ValueCountFrequency (%) 
ASCII 54 100.0%
 

Order Customer Id
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count20652
Unique (%)11.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6691.379495
Minimum1
Maximum20757
Zeros0
Zeros (%)0.0%
Memory size1.4 MiB

Quantile statistics

Minimum1
5-th percentile649
Q13258.5
median6457
Q39779
95-th percentile12383
Maximum20757
Range20756
Interquartile range (IQR)6520.5

Descriptive statistics

Standard deviation4162.918106
Coefficient of variation (CV)0.6221315215
Kurtosis0.01489882226
Mean6691.379495
Median Absolute Deviation (MAD)3447.119576
Skewness0.4887682515
Sum1207921135
Variance17329887.16
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[1.00000e+00 1.03500e+02 1.22500e+02 1.24500e+02 1.36500e+02 ... 1.24055e+04 1.24305e+04 1.24325e+04 1.24355e+04 2.07570e+04], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
5654 47 < 0.1%
 
5004 45 < 0.1%
 
10591 45 < 0.1%
 
5715 44 < 0.1%
 
3708 44 < 0.1%
 
9371 44 < 0.1%
 
1443 43 < 0.1%
 
2641 43 < 0.1%
 
791 43 < 0.1%
 
12284 43 < 0.1%
 
Other values (20642) 180078 99.8%
 
ValueCountFrequency (%) 
1 1 < 0.1%
 
2 10 < 0.1%
 
3 18 < 0.1%
 
4 14 < 0.1%
 
5 7 < 0.1%
 
ValueCountFrequency (%) 
20757 1 < 0.1%
 
20756 1 < 0.1%
 
20755 1 < 0.1%
 
20754 1 < 0.1%
 
20753 1 < 0.1%
 

order date (DateOrders)
Categorical

HIGH CARDINALITY
UNIFORM
Distinct count65752
Unique (%)36.4%
Missing0
Missing (%)0.0%
Memory size1.4 MiB
11/1/2016 19:01
 
5
8/5/2015 6:42
 
5
9/29/2016 16:06
 
5
9/28/2017 7:53
 
5
11/18/2016 7:37
 
5
Other values (65747)
180494
ValueCountFrequency (%) 
11/1/2016 19:01 5 < 0.1%
 
8/5/2015 6:42 5 < 0.1%
 
9/29/2016 16:06 5 < 0.1%
 
9/28/2017 7:53 5 < 0.1%
 
11/18/2016 7:37 5 < 0.1%
 
11/19/2015 3:45 5 < 0.1%
 
9/7/2015 9:59 5 < 0.1%
 
4/3/2016 14:25 5 < 0.1%
 
2/18/2015 8:20 5 < 0.1%
 
7/17/2015 12:30 5 < 0.1%
 
Other values (65742) 180469 > 99.9%
 

Length

Max length16
Mean length14.49898903
Min length13
ValueCountFrequency (%) 
Decimal_Number 10 76.9%
 
Other_Punctuation 2 15.4%
 
Space_Separator 1 7.7%
 
ValueCountFrequency (%) 
Common 13 100.0%
 
ValueCountFrequency (%) 
ASCII 13 100.0%
 

Order Id
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count65752
Unique (%)36.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean36221.8949
Minimum1
Maximum77204
Zeros0
Zeros (%)0.0%
Memory size1.4 MiB

Quantile statistics

Minimum1
5-th percentile3626.8
Q118057
median36140
Q354144
95-th percentile68596
Maximum77204
Range77203
Interquartile range (IQR)36087

Descriptive statistics

Standard deviation21045.37957
Coefficient of variation (CV)0.5810126617
Kurtosis-1.152935673
Mean36221.8949
Median Absolute Deviation (MAD)18173.33239
Skewness0.03270879463
Sum6538740246
Variance442908001.2
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[1.00000e+00 6.88835e+04 7.72040e+04], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
30397 5 < 0.1%
 
16203 5 < 0.1%
 
42806 5 < 0.1%
 
49302 5 < 0.1%
 
60278 5 < 0.1%
 
62325 5 < 0.1%
 
40753 5 < 0.1%
 
47257 5 < 0.1%
 
67308 5 < 0.1%
 
26414 5 < 0.1%
 
Other values (65742) 180469 > 99.9%
 
ValueCountFrequency (%) 
1 1 < 0.1%
 
2 3 < 0.1%
 
4 4 < 0.1%
 
5 5 < 0.1%
 
7 3 < 0.1%
 
ValueCountFrequency (%) 
77204 1 < 0.1%
 
77203 1 < 0.1%
 
77202 1 < 0.1%
 
77201 1 < 0.1%
 
77200 1 < 0.1%
 

Order Item Cardprod Id
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count118
Unique (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean692.5097635
Minimum19
Maximum1363
Zeros0
Zeros (%)0.0%
Memory size1.4 MiB

Quantile statistics

Minimum19
5-th percentile191
Q1403
median627
Q31004
95-th percentile1073
Maximum1363
Range1344
Interquartile range (IQR)601

Descriptive statistics

Standard deviation336.4468073
Coefficient of variation (CV)0.4858369153
Kurtosis-1.267493907
Mean692.5097635
Median Absolute Deviation (MAD)309.2081086
Skewness0.1382546099
Sum125011170
Variance113196.4542
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 19. 21.5 36. 40.5 51. ... 1359.5 1360.5 1361.5 1362.5 1363. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
365 24515 13.6%
 
403 22246 12.3%
 
502 21035 11.7%
 
1014 19298 10.7%
 
1004 17325 9.6%
 
1073 15500 8.6%
 
957 13729 7.6%
 
191 12169 6.7%
 
627 10617 5.9%
 
1362 838 0.5%
 
Other values (108) 23247 12.9%
 
ValueCountFrequency (%) 
19 64 < 0.1%
 
24 74 < 0.1%
 
35 65 < 0.1%
 
37 262 0.1%
 
44 305 0.2%
 
ValueCountFrequency (%) 
1363 650 0.4%
 
1362 838 0.5%
 
1361 529 0.3%
 
1360 357 0.2%
 
1359 492 0.3%
 

Order Item Discount
Real number (ℝ≥0)

ZEROS
Distinct count1017
Unique (%)0.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20.66474112
Minimum0
Maximum500
Zeros10028
Zeros (%)5.6%
Memory size1.4 MiB

Quantile statistics

Minimum0
5-th percentile0
Q15.400000095
median14
Q329.98999977
95-th percentile62.5
Maximum500
Range500
Interquartile range (IQR)24.58999968

Descriptive statistics

Standard deviation21.80090095
Coefficient of variation (CV)1.054980598
Kurtosis25.23126719
Mean20.66474112
Median Absolute Deviation (MAD)15.68134227
Skewness3.039795514
Sum3730378.403
Variance475.2792824
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.00000000e+00 5.00000005e-02 1.05000000e-01 1.35000001e-01 2.25000001e-01 ... 2.12500000e+02 3.10000000e+02 3.67500000e+02 3.87500000e+02 5.00000000e+02], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 10028 5.6%
 
6 4589 2.5%
 
12 4067 2.3%
 
4 3647 2.0%
 
8 3626 2.0%
 
10 3424 1.9%
 
36 3268 1.8%
 
30 3230 1.8%
 
20 3123 1.7%
 
9 2964 1.6%
 
Other values (1007) 138553 76.8%
 
ValueCountFrequency (%) 
0 10028 5.6%
 
0.100000001 3 < 0.1%
 
0.109999999 15 < 0.1%
 
0.119999997 29 < 0.1%
 
0.150000006 7 < 0.1%
 
ValueCountFrequency (%) 
500 1 < 0.1%
 
400 1 < 0.1%
 
375 25 < 0.1%
 
360 1 < 0.1%
 
340 1 < 0.1%
 

Order Item Discount Rate
Real number (ℝ≥0)

ZEROS
Distinct count18
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.1016681906
Minimum0
Maximum0.25
Zeros10028
Zeros (%)5.6%
Memory size1.4 MiB

Quantile statistics

Minimum0
5-th percentile0
Q10.039999999
median0.100000001
Q30.159999996
95-th percentile0.25
Maximum0.25
Range0.25
Interquartile range (IQR)0.119999997

Descriptive statistics

Standard deviation0.07041521533
Coefficient of variation (CV)0.692598294
Kurtosis-0.9011568627
Mean0.1016681906
Median Absolute Deviation (MAD)0.06074039626
Skewness0.3409276012
Sum18353.04009
Variance0.004958302549
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0. 0.005 0.065 0.155 0.175 0.19 0.225 0.25 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.159999996 10029 5.6%
 
0.07 10029 5.6%
 
0.180000007 10029 5.6%
 
0.050000001 10029 5.6%
 
0.039999999 10029 5.6%
 
0.150000006 10029 5.6%
 
0.100000001 10029 5.6%
 
0.129999995 10029 5.6%
 
0.25 10029 5.6%
 
0.090000004 10029 5.6%
 
Other values (8) 80229 44.4%
 
ValueCountFrequency (%) 
0 10028 5.6%
 
0.01 10028 5.6%
 
0.02 10028 5.6%
 
0.029999999 10029 5.6%
 
0.039999999 10029 5.6%
 
ValueCountFrequency (%) 
0.25 10029 5.6%
 
0.200000003 10029 5.6%
 
0.180000007 10029 5.6%
 
0.170000002 10029 5.6%
 
0.159999996 10029 5.6%
 

Order Item Id
Real number (ℝ≥0)

HIGH CORRELATION
UNIFORM
UNIQUE
Distinct count180519
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean90260
Minimum1
Maximum180519
Zeros0
Zeros (%)0.0%
Memory size1.4 MiB

Quantile statistics

Minimum1
5-th percentile9026.9
Q145130.5
median90260
Q3135389.5
95-th percentile171493.1
Maximum180519
Range180518
Interquartile range (IQR)90259

Descriptive statistics

Standard deviation52111.49096
Coefficient of variation (CV)0.5773486701
Kurtosis-1.2
Mean90260
Median Absolute Deviation (MAD)45129.75
Skewness8.455466102e-18
Sum1.629364494e+10
Variance2715607490
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[1.00000e+00 1.80519e+05], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2047 1 < 0.1%
 
79372 1 < 0.1%
 
144748 1 < 0.1%
 
134507 1 < 0.1%
 
132458 1 < 0.1%
 
138601 1 < 0.1%
 
136552 1 < 0.1%
 
159079 1 < 0.1%
 
157030 1 < 0.1%
 
163173 1 < 0.1%
 
Other values (180509) 180509 > 99.9%
 
ValueCountFrequency (%) 
1 1 < 0.1%
 
2 1 < 0.1%
 
3 1 < 0.1%
 
4 1 < 0.1%
 
5 1 < 0.1%
 
ValueCountFrequency (%) 
180519 1 < 0.1%
 
180518 1 < 0.1%
 
180517 1 < 0.1%
 
180516 1 < 0.1%
 
180515 1 < 0.1%
 

Order Item Product Price
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count75
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean141.2325499
Minimum9.989999771
Maximum1999.98999
Zeros0
Zeros (%)0.0%
Memory size1.4 MiB

Quantile statistics

Minimum9.989999771
5-th percentile31.98999977
Q150
median59.99000168
Q3199.9900055
95-th percentile399.980011
Maximum1999.98999
Range1989.99999
Interquartile range (IQR)149.9900055

Descriptive statistics

Standard deviation139.732492
Coefficient of variation (CV)0.9893788087
Kurtosis23.31299748
Mean141.2325499
Median Absolute Deviation (MAD)102.813998
Skewness3.19101957
Sum25495158.68
Variance19525.16932
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 9.98999977 11.41499996 15.48999977 16.98999977 18.98999977 ... 566.28500365 799.9899902 1249.9949951 1749.994995 1999.98999 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
59.99000168 24820 13.7%
 
129.9900055 22372 12.4%
 
50 21035 11.7%
 
49.97999954 19298 10.7%
 
399.980011 17325 9.6%
 
199.9900055 15622 8.7%
 
299.980011 13729 7.6%
 
99.98999786 12433 6.9%
 
39.99000168 11201 6.2%
 
24.98999977 2339 1.3%
 
Other values (65) 20345 11.3%
 
ValueCountFrequency (%) 
9.989999771 285 0.2%
 
11.28999996 271 0.2%
 
11.53999996 529 0.3%
 
14.98999977 593 0.3%
 
15.98999977 602 0.3%
 
ValueCountFrequency (%) 
1999.98999 15 < 0.1%
 
1500 442 0.2%
 
999.9899902 10 < 0.1%
 
599.9899902 21 < 0.1%
 
532.5800171 484 0.3%
 

Order Item Profit Ratio
Real number (ℝ)

Distinct count162
Unique (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.1206466355
Minimum-2.75
Maximum0.5
Zeros1177
Zeros (%)0.7%
Memory size1.4 MiB

Quantile statistics

Minimum-2.75
5-th percentile-0.769999981
Q10.079999998
median0.270000011
Q30.360000014
95-th percentile0.479999989
Maximum0.5
Range3.25
Interquartile range (IQR)0.280000016

Descriptive statistics

Standard deviation0.4667956046
Coefficient of variation (CV)3.869114151
Kurtosis10.15722452
Mean0.1206466355
Median Absolute Deviation (MAD)0.2941215544
Skewness-2.893531341
Sum21779.00999
Variance0.2178981365
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[-2.75 -2.72500002 -2.67500007 -2.625 -2.47500002 ... 0.465 0.47499999 0.485 0.495 0.5 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.479999989 9197 5.1%
 
0.349999994 7997 4.4%
 
0.25999999 6577 3.6%
 
0.340000004 6507 3.6%
 
0.469999999 6378 3.5%
 
0.360000014 6108 3.4%
 
0.330000013 5789 3.2%
 
0.49000001 5688 3.2%
 
0.289999992 5478 3.0%
 
0.280000001 5403 3.0%
 
Other values (152) 115397 63.9%
 
ValueCountFrequency (%) 
-2.75 72 < 0.1%
 
-2.700000048 252 0.1%
 
-2.650000095 90 < 0.1%
 
-2.599999905 181 0.1%
 
-2.549999952 234 0.1%
 
ValueCountFrequency (%) 
0.5 2529 1.4%
 
0.49000001 5688 3.2%
 
0.479999989 9197 5.1%
 
0.469999999 6378 3.5%
 
0.460000008 4822 2.7%
 

Order Item Quantity
Real number (ℝ≥0)

Distinct count5
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.127637534
Minimum1
Maximum5
Zeros0
Zeros (%)0.0%
Memory size1.4 MiB

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q33
95-th percentile5
Maximum5
Range4
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.453451481
Coefficient of variation (CV)0.6831292728
Kurtosis-0.7537015772
Mean2.127637534
Median Absolute Deviation (MAD)1.267236976
Skewness0.8802518479
Sum384079
Variance2.112521209
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[1. 1.5 4.5 5. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1 99134 54.9%
 
5 20385 11.3%
 
3 20350 11.3%
 
4 20335 11.3%
 
2 20315 11.3%
 
ValueCountFrequency (%) 
1 99134 54.9%
 
2 20315 11.3%
 
3 20350 11.3%
 
4 20335 11.3%
 
5 20385 11.3%
 
ValueCountFrequency (%) 
5 20385 11.3%
 
4 20335 11.3%
 
3 20350 11.3%
 
2 20315 11.3%
 
1 99134 54.9%
 

Sales
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count193
Unique (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean203.7720961
Minimum9.989999771
Maximum1999.98999
Zeros0
Zeros (%)0.0%
Memory size1.4 MiB

Quantile statistics

Minimum9.989999771
5-th percentile49.97999954
Q1119.9800034
median199.9199982
Q3299.9500122
95-th percentile399.980011
Maximum1999.98999
Range1989.99999
Interquartile range (IQR)179.9700088

Descriptive statistics

Standard deviation132.2730775
Coefficient of variation (CV)0.649122623
Kurtosis23.93656127
Mean203.7720961
Median Absolute Deviation (MAD)95.94152385
Skewness2.884249049
Sum36784735.01
Variance17496.16703
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 9.98999977 10.63999987 13.26499987 16.98999977 19.98499966 ... 566.28500365 799.9899902 1249.9949951 1749.994995 1999.98999 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
129.9900055 22372 12.4%
 
399.980011 17325 9.6%
 
199.9900055 15622 8.7%
 
299.980011 13729 7.6%
 
179.9700012 5016 2.8%
 
299.9500122 4988 2.8%
 
119.9800034 4968 2.8%
 
239.9600067 4955 2.7%
 
59.99000168 4893 2.7%
 
50 4432 2.5%
 
Other values (183) 82219 45.5%
 
ValueCountFrequency (%) 
9.989999771 56 < 0.1%
 
11.28999996 271 0.2%
 
11.53999996 529 0.3%
 
14.98999977 124 0.1%
 
15.98999977 118 0.1%
 
ValueCountFrequency (%) 
1999.98999 15 < 0.1%
 
1500 442 0.2%
 
999.9899902 10 < 0.1%
 
599.9899902 21 < 0.1%
 
532.5800171 484 0.3%
 

Order Item Total
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count2927
Unique (%)1.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean183.1076085
Minimum7.489999771
Maximum1939.98999
Zeros0
Zeros (%)0.0%
Memory size1.4 MiB

Quantile statistics

Minimum7.489999771
5-th percentile41.5
Q1104.3799973
median163.9900055
Q3247.3999939
95-th percentile383.980011
Maximum1939.98999
Range1932.49999
Interquartile range (IQR)143.0199966

Descriptive statistics

Standard deviation120.04367
Coefficient of variation (CV)0.6555908354
Kurtosis23.92036151
Mean183.1076085
Median Absolute Deviation (MAD)87.66090517
Skewness2.888446057
Sum33054402.38
Variance14410.48271
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 7.48999977 8.43000031 8.48000002 8.57499981 8.67499971 ... 1057.4949951 1215. 1492.494995 1549.994995 1939.98999 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
122.8399963 1264 0.7%
 
109.1900024 1247 0.7%
 
114.3899994 1243 0.7%
 
113.0899963 1243 0.7%
 
129.9900055 1243 0.7%
 
126.0899963 1243 0.7%
 
107.8899994 1243 0.7%
 
123.4899979 1243 0.7%
 
97.48999786 1243 0.7%
 
110.4899979 1243 0.7%
 
Other values (2917) 168064 93.1%
 
ValueCountFrequency (%) 
7.489999771 3 < 0.1%
 
7.989999771 3 < 0.1%
 
8.18999958 3 < 0.1%
 
8.289999962 3 < 0.1%
 
8.390000343 3 < 0.1%
 
ValueCountFrequency (%) 
1939.98999 1 < 0.1%
 
1919.98999 1 < 0.1%
 
1899.98999 1 < 0.1%
 
1889.98999 1 < 0.1%
 
1859.98999 1 < 0.1%
 

Order Profit Per Order
Real number (ℝ)

HIGH CORRELATION
Distinct count21998
Unique (%)12.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean21.97498864
Minimum-4274.97998
Maximum911.7999878
Zeros1177
Zeros (%)0.7%
Memory size1.4 MiB

Quantile statistics

Minimum-4274.97998
5-th percentile-139.2509994
Q17
median31.52000046
Q364.80000305
95-th percentile132.2899933
Maximum911.7999878
Range5186.779968
Interquartile range (IQR)57.80000305

Descriptive statistics

Standard deviation104.4335257
Coefficient of variation (CV)4.752381331
Kurtosis71.37725866
Mean21.97498864
Median Absolute Deviation (MAD)56.08869441
Skewness-4.74183407
Sum3966902.974
Variance10906.3613
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[-4274.97998 -1355.61499 -1083.169983 -824.94000245 -674.98498535 ... 239.8199997 240.19000245 245.32499695 720.9499817 911.7999878 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 1177 0.7%
 
143.9900055 199 0.1%
 
72 194 0.1%
 
46.79999924 188 0.1%
 
24 181 0.1%
 
18 175 0.1%
 
63.70000076 172 0.1%
 
62.40000153 168 0.1%
 
14.39999962 166 0.1%
 
12 166 0.1%
 
Other values (21988) 177733 98.5%
 
ValueCountFrequency (%) 
-4274.97998 1 < 0.1%
 
-3442.5 1 < 0.1%
 
-3366 1 < 0.1%
 
-3000 1 < 0.1%
 
-2592 1 < 0.1%
 
ValueCountFrequency (%) 
911.7999878 1 < 0.1%
 
864 1 < 0.1%
 
721.5999756 1 < 0.1%
 
720.2999878 1 < 0.1%
 
720 2 < 0.1%
 

Order Region
Categorical

HIGH CORRELATION
Distinct count23
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.4 MiB
Central America
28341
Western Europe
27109
South America
 
14935
Oceania
 
10148
Northern Europe
 
9792
Other values (18)
90194
ValueCountFrequency (%) 
Central America 28341 15.7%
 
Western Europe 27109 15.0%
 
South America 14935 8.3%
 
Oceania 10148 5.6%
 
Northern Europe 9792 5.4%
 
Southeast Asia 9539 5.3%
 
Southern Europe 9431 5.2%
 
Caribbean 8318 4.6%
 
West of USA 7993 4.4%
 
South Asia 7731 4.3%
 
Other values (13) 47182 26.1%
 

Length

Max length15
Mean length12.63430442
Min length6
ValueCountFrequency (%) 
Lowercase_Letter 17 65.4%
 
Uppercase_Letter 8 30.8%
 
Space_Separator 1 3.8%
 
ValueCountFrequency (%) 
Latin 25 96.2%
 
Common 1 3.8%
 
ValueCountFrequency (%) 
ASCII 26 100.0%
 

Order State
Categorical

HIGH CARDINALITY
Distinct count1089
Unique (%)0.6%
Missing0
Missing (%)0.0%
Memory size1.4 MiB
Inglaterra
 
6722
California
 
4966
Isla de Francia
 
4580
Renania del Norte-Westfalia
 
3303
San Salvador
 
3055
Other values (1084)
157893
ValueCountFrequency (%) 
Inglaterra 6722 3.7%
 
California 4966 2.8%
 
Isla de Francia 4580 2.5%
 
Renania del Norte-Westfalia 3303 1.8%
 
San Salvador 3055 1.7%
 
Nueva York 2753 1.5%
 
Distrito Federal 2559 1.4%
 
Texas 2446 1.4%
 
Nueva Gales del Sur 2370 1.3%
 
Santo Domingo 2211 1.2%
 
Other values (1079) 145554 80.6%
 

Length

Max length36
Mean length10.87261729
Min length3
ValueCountFrequency (%) 
Lowercase_Letter 45 54.2%
 
Uppercase_Letter 30 36.1%
 
Control 2 2.4%
 
Other_Punctuation 2 2.4%
 
Dash_Punctuation 1 1.2%
 
Space_Separator 1 1.2%
 
Close_Punctuation 1 1.2%
 
Open_Punctuation 1 1.2%
 
ValueCountFrequency (%) 
Latin 75 90.4%
 
Common 8 9.6%
 
ValueCountFrequency (%) 
ASCII 58 100.0%
 

Order Status
Categorical

HIGH CORRELATION
Distinct count9
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.4 MiB
COMPLETE
59491
PENDING_PAYMENT
39832
PROCESSING
21902
PENDING
20227
CLOSED
19616
Other values (4)
19451
ValueCountFrequency (%) 
COMPLETE 59491 33.0%
 
PENDING_PAYMENT 39832 22.1%
 
PROCESSING 21902 12.1%
 
PENDING 20227 11.2%
 
CLOSED 19616 10.9%
 
ON_HOLD 9804 5.4%
 
SUSPECTED_FRAUD 4062 2.3%
 
CANCELED 3692 2.0%
 
PAYMENT_REVIEW 1893 1.0%
 

Length

Max length15
Mean length9.62396756
Min length6
ValueCountFrequency (%) 
Uppercase_Letter 20 95.2%
 
Connector_Punctuation 1 4.8%
 
ValueCountFrequency (%) 
Latin 20 95.2%
 
Common 1 4.8%
 
ValueCountFrequency (%) 
ASCII 21 100.0%
 

Order Zipcode
Real number (ℝ≥0)

MISSING
Distinct count609
Unique (%)2.5%
Missing155679
Missing (%)86.2%
Infinite0
Infinite (%)0.0%
Mean55426.13233
Minimum1040
Maximum99301
Zeros0
Zeros (%)0.0%
Memory size1.4 MiB

Quantile statistics

Minimum1040
5-th percentile10009
Q123464
median59405
Q390008
95-th percentile98026
Maximum99301
Range98261
Interquartile range (IQR)66544

Descriptive statistics

Standard deviation31919.2791
Coefficient of variation (CV)0.5758886244
Kurtosis-1.48173452
Mean55426.13233
Median Absolute Deviation (MAD)28934.40503
Skewness-0.1403113969
Sum1376785127
Variance1018840378
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
10035 648 0.4%
 
10009 550 0.3%
 
10024 541 0.3%
 
94122 526 0.3%
 
10011 463 0.3%
 
94110 407 0.2%
 
98105 406 0.2%
 
19140 386 0.2%
 
90049 368 0.2%
 
98103 366 0.2%
 
Other values (599) 20179 11.2%
 
(Missing) 155679 86.2%
 
ValueCountFrequency (%) 
1040 3 < 0.1%
 
1453 17 < 0.1%
 
1752 6 < 0.1%
 
1810 13 < 0.1%
 
1841 88 < 0.1%
 
ValueCountFrequency (%) 
99301 10 < 0.1%
 
99207 22 < 0.1%
 
98661 12 < 0.1%
 
98632 10 < 0.1%
 
98502 9 < 0.1%
 

Product Card Id
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count118
Unique (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean692.5097635
Minimum19
Maximum1363
Zeros0
Zeros (%)0.0%
Memory size1.4 MiB

Quantile statistics

Minimum19
5-th percentile191
Q1403
median627
Q31004
95-th percentile1073
Maximum1363
Range1344
Interquartile range (IQR)601

Descriptive statistics

Standard deviation336.4468073
Coefficient of variation (CV)0.4858369153
Kurtosis-1.267493907
Mean692.5097635
Median Absolute Deviation (MAD)309.2081086
Skewness0.1382546099
Sum125011170
Variance113196.4542
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 19. 21.5 36. 40.5 51. ... 1359.5 1360.5 1361.5 1362.5 1363. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
365 24515 13.6%
 
403 22246 12.3%
 
502 21035 11.7%
 
1014 19298 10.7%
 
1004 17325 9.6%
 
1073 15500 8.6%
 
957 13729 7.6%
 
191 12169 6.7%
 
627 10617 5.9%
 
1362 838 0.5%
 
Other values (108) 23247 12.9%
 
ValueCountFrequency (%) 
19 64 < 0.1%
 
24 74 < 0.1%
 
35 65 < 0.1%
 
37 262 0.1%
 
44 305 0.2%
 
ValueCountFrequency (%) 
1363 650 0.4%
 
1362 838 0.5%
 
1361 529 0.3%
 
1360 357 0.2%
 
1359 492 0.3%
 

Product Category Id
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count51
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean31.85145054
Minimum2
Maximum76
Zeros0
Zeros (%)0.0%
Memory size1.4 MiB

Quantile statistics

Minimum2
5-th percentile9
Q118
median29
Q345
95-th percentile48
Maximum76
Range74
Interquartile range (IQR)27

Descriptive statistics

Standard deviation15.64006388
Coefficient of variation (CV)0.4910314481
Kurtosis-0.6032610083
Mean31.85145054
Median Absolute Deviation (MAD)13.94913209
Skewness0.3616247994
Sum5749792
Variance244.6115983
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 2. 2.5 3.5 4.5 6.5 ... 72.5 73.5 74.5 75.5 76. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
17 24551 13.6%
 
18 22246 12.3%
 
24 21035 11.7%
 
46 19298 10.7%
 
45 17325 9.6%
 
48 15540 8.6%
 
43 13729 7.6%
 
9 12487 6.9%
 
29 10984 6.1%
 
37 2029 1.1%
 
Other values (41) 21295 11.8%
 
ValueCountFrequency (%) 
2 138 0.1%
 
3 632 0.4%
 
4 67 < 0.1%
 
5 343 0.2%
 
6 328 0.2%
 
ValueCountFrequency (%) 
76 650 0.4%
 
75 838 0.5%
 
74 529 0.3%
 
73 357 0.2%
 
72 492 0.3%
 

Product Description
Unsupported

MISSING
REJECTED
UNSUPPORTED
Missing180519
Missing (%)100.0%
Memory size1.4 MiB
Distinct count118
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size1.4 MiB
http://images.acmesports.sports/Perfect+Fitness+Perfect+Rip+Deck
24515
http://images.acmesports.sports/Nike+Men%27s+CJ+Elite+2+TD+Football+Cleat
22246
http://images.acmesports.sports/Nike+Men%27s+Dri-FIT+Victory+Golf+Polo
21035
http://images.acmesports.sports/O%27Brien+Men%27s+Neoprene+Life+Vest
19298
http://images.acmesports.sports/Field+%26+Stream+Sportsman+16+Gun+Fire+Safe
17325
Other values (113)
76100
ValueCountFrequency (%) 
http://images.acmesports.sports/Perfect+Fitness+Perfect+Rip+Deck 24515 13.6%
 
http://images.acmesports.sports/Nike+Men%27s+CJ+Elite+2+TD+Football+Cleat 22246 12.3%
 
http://images.acmesports.sports/Nike+Men%27s+Dri-FIT+Victory+Golf+Polo 21035 11.7%
 
http://images.acmesports.sports/O%27Brien+Men%27s+Neoprene+Life+Vest 19298 10.7%
 
http://images.acmesports.sports/Field+%26+Stream+Sportsman+16+Gun+Fire+Safe 17325 9.6%
 
http://images.acmesports.sports/Pelican+Sunstream+100+Kayak 15500 8.6%
 
http://images.acmesports.sports/Diamondback+Women%27s+Serene+Classic+Comfort+Bike+2014 13729 7.6%
 
http://images.acmesports.sports/Nike+Men%27s+Free+5.0%2B+Running+Shoe 12169 6.7%
 
http://images.acmesports.sports/Under+Armour+Girls%27+Toddler+Spine+Surge+Running+Shoe 10617 5.9%
 
http://images.acmesports.sports/Fighting+video+games 838 0.5%
 
Other values (108) 23247 12.9%
 
ValueCountFrequency (%) 
http 180519 100.0%
 
ValueCountFrequency (%) 
images.acmesports.sports 180519 100.0%
 
ValueCountFrequency (%) 
/Perfect+Fitness+Perfect+Rip+Deck 24515 13.6%
 
/Nike+Men%27s+CJ+Elite+2+TD+Football+Cleat 22246 12.3%
 
/Nike+Men%27s+Dri-FIT+Victory+Golf+Polo 21035 11.7%
 
/O%27Brien+Men%27s+Neoprene+Life+Vest 19298 10.7%
 
/Field+%26+Stream+Sportsman+16+Gun+Fire+Safe 17325 9.6%
 
/Pelican+Sunstream+100+Kayak 15500 8.6%
 
/Diamondback+Women%27s+Serene+Classic+Comfort+Bike+2014 13729 7.6%
 
/Nike+Men%27s+Free+5.0%2B+Running+Shoe 12169 6.7%
 
/Under+Armour+Girls%27+Toddler+Spine+Surge+Running+Shoe 10617 5.9%
 
/Fighting+video+games 838 0.5%
 
Other values (108) 23247 12.9%
 
ValueCountFrequency (%) 
180519 100.0%
 
ValueCountFrequency (%) 
180519 100.0%
 

Product Name
Categorical

HIGH CARDINALITY
Distinct count118
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size1.4 MiB
Perfect Fitness Perfect Rip Deck
24515
Nike Men's CJ Elite 2 TD Football Cleat
22246
Nike Men's Dri-FIT Victory Golf Polo
21035
O'Brien Men's Neoprene Life Vest
19298
Field & Stream Sportsman 16 Gun Fire Safe
17325
Other values (113)
76100
ValueCountFrequency (%) 
Perfect Fitness Perfect Rip Deck 24515 13.6%
 
Nike Men's CJ Elite 2 TD Football Cleat 22246 12.3%
 
Nike Men's Dri-FIT Victory Golf Polo 21035 11.7%
 
O'Brien Men's Neoprene Life Vest 19298 10.7%
 
Field & Stream Sportsman 16 Gun Fire Safe 17325 9.6%
 
Pelican Sunstream 100 Kayak 15500 8.6%
 
Diamondback Women's Serene Classic Comfort Bi 13729 7.6%
 
Nike Men's Free 5.0+ Running Shoe 12169 6.7%
 
Under Armour Girls' Toddler Spine Surge Runni 10617 5.9%
 
Fighting video games 838 0.5%
 
Other values (108) 23247 12.9%
 

Length

Max length45
Mean length35.12000399
Min length5
ValueCountFrequency (%) 
Uppercase_Letter 25 37.9%
 
Lowercase_Letter 24 36.4%
 
Decimal_Number 10 15.2%
 
Other_Punctuation 4 6.1%
 
Math_Symbol 1 1.5%
 
Space_Separator 1 1.5%
 
Dash_Punctuation 1 1.5%
 
ValueCountFrequency (%) 
Latin 49 74.2%
 
Common 17 25.8%
 
ValueCountFrequency (%) 
ASCII 66 100.0%
 

Product Price
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count75
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean141.2325499
Minimum9.989999771
Maximum1999.98999
Zeros0
Zeros (%)0.0%
Memory size1.4 MiB

Quantile statistics

Minimum9.989999771
5-th percentile31.98999977
Q150
median59.99000168
Q3199.9900055
95-th percentile399.980011
Maximum1999.98999
Range1989.99999
Interquartile range (IQR)149.9900055

Descriptive statistics

Standard deviation139.732492
Coefficient of variation (CV)0.9893788087
Kurtosis23.31299748
Mean141.2325499
Median Absolute Deviation (MAD)102.813998
Skewness3.19101957
Sum25495158.68
Variance19525.16932
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 9.98999977 11.41499996 15.48999977 16.98999977 18.98999977 ... 566.28500365 799.9899902 1249.9949951 1749.994995 1999.98999 ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
59.99000168 24820 13.7%
 
129.9900055 22372 12.4%
 
50 21035 11.7%
 
49.97999954 19298 10.7%
 
399.980011 17325 9.6%
 
199.9900055 15622 8.7%
 
299.980011 13729 7.6%
 
99.98999786 12433 6.9%
 
39.99000168 11201 6.2%
 
24.98999977 2339 1.3%
 
Other values (65) 20345 11.3%
 
ValueCountFrequency (%) 
9.989999771 285 0.2%
 
11.28999996 271 0.2%
 
11.53999996 529 0.3%
 
14.98999977 593 0.3%
 
15.98999977 602 0.3%
 
ValueCountFrequency (%) 
1999.98999 15 < 0.1%
 
1500 442 0.2%
 
999.9899902 10 < 0.1%
 
599.9899902 21 < 0.1%
 
532.5800171 484 0.3%
 

Product Status
Boolean

CONSTANT
REJECTED
Distinct count1
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.4 MiB
0
180519
ValueCountFrequency (%) 
0 180519 100.0%
 

shipping date (DateOrders)
Categorical

HIGH CARDINALITY
UNIFORM
Distinct count63701
Unique (%)35.3%
Missing0
Missing (%)0.0%
Memory size1.4 MiB
5/21/2015 15:34
 
10
10/30/2016 3:16
 
10
7/16/2015 10:14
 
10
10/8/2016 23:37
 
10
5/21/2015 10:19
 
10
Other values (63696)
180469
ValueCountFrequency (%) 
5/21/2015 15:34 10 < 0.1%
 
10/30/2016 3:16 10 < 0.1%
 
7/16/2015 10:14 10 < 0.1%
 
10/8/2016 23:37 10 < 0.1%
 
5/21/2015 10:19 10 < 0.1%
 
7/5/2017 10:59 10 < 0.1%
 
11/6/2015 23:55 10 < 0.1%
 
7/19/2016 5:41 10 < 0.1%
 
9/10/2015 20:19 10 < 0.1%
 
1/4/2017 17:11 10 < 0.1%
 
Other values (63691) 180419 99.9%
 

Length

Max length16
Mean length14.50194162
Min length13
ValueCountFrequency (%) 
Decimal_Number 10 76.9%
 
Other_Punctuation 2 15.4%
 
Space_Separator 1 7.7%
 
ValueCountFrequency (%) 
Common 13 100.0%
 
ValueCountFrequency (%) 
ASCII 13 100.0%
 

Shipping Mode
Categorical

HIGH CORRELATION
Distinct count4
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.4 MiB
Standard Class
107752
Second Class
35216
First Class
27814
Same Day
 
9737
ValueCountFrequency (%) 
Standard Class 107752 59.7%
 
Second Class 35216 19.5%
 
First Class 27814 15.4%
 
Same Day 9737 5.4%
 

Length

Max length14
Mean length12.82396867
Min length8
ValueCountFrequency (%) 
Lowercase_Letter 13 72.2%
 
Uppercase_Letter 4 22.2%
 
Space_Separator 1 5.6%
 
ValueCountFrequency (%) 
Latin 17 94.4%
 
Common 1 5.6%
 
ValueCountFrequency (%) 
ASCII 18 100.0%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Missing values

Sample

First rows

TypeDays for shipping (real)Days for shipment (scheduled)Benefit per orderSales per customerDelivery StatusLate_delivery_riskCategory IdCategory NameCustomer CityCustomer CountryCustomer EmailCustomer FnameCustomer IdCustomer LnameCustomer PasswordCustomer SegmentCustomer StateCustomer StreetCustomer ZipcodeDepartment IdDepartment NameLatitudeLongitudeMarketOrder CityOrder CountryOrder Customer Idorder date (DateOrders)Order IdOrder Item Cardprod IdOrder Item DiscountOrder Item Discount RateOrder Item IdOrder Item Product PriceOrder Item Profit RatioOrder Item QuantitySalesOrder Item TotalOrder Profit Per OrderOrder RegionOrder StateOrder StatusOrder ZipcodeProduct Card IdProduct Category IdProduct DescriptionProduct ImageProduct NameProduct PriceProduct Statusshipping date (DateOrders)Shipping Mode
0DEBIT3491.250000314.640015Advance shipping073Sporting GoodsCaguasPuerto RicoXXXXXXXXXCally20755HollowayXXXXXXXXXConsumerPR5365 Noble Nectar Island725.02Fitness18.251453-66.037056Pacific AsiaBekasiIndonesia207551/31/2018 22:5677202136013.1100000.04180517327.750.291327.75314.64001591.250000Southeast AsiaJava OccidentalCOMPLETENaN136073NaNhttp://images.acmesports.sports/Smart+watchSmart watch327.7502/3/2018 22:56Standard Class
1TRANSFER54-249.089996311.359985Late delivery173Sporting GoodsCaguasPuerto RicoXXXXXXXXXIrene19492LunaXXXXXXXXXConsumerPR2679 Rustic Loop725.02Fitness18.279451-66.037064Pacific AsiaBikanerIndia194921/13/2018 12:2775939136016.3899990.05179254327.75-0.801327.75311.359985-249.089996South AsiaRajastánPENDINGNaN136073NaNhttp://images.acmesports.sports/Smart+watchSmart watch327.7501/18/2018 12:27Standard Class
2CASH44-247.779999309.720001Shipping on time073Sporting GoodsSan JoseEE. UU.XXXXXXXXXGillian19491MaldonadoXXXXXXXXXConsumerCA8510 Round Bear Gate95125.02Fitness37.292233-121.881279Pacific AsiaBikanerIndia194911/13/2018 12:0675938136018.0300010.06179253327.75-0.801327.75309.720001-247.779999South AsiaRajastánCLOSEDNaN136073NaNhttp://images.acmesports.sports/Smart+watchSmart watch327.7501/17/2018 12:06Standard Class
3DEBIT3422.860001304.809998Advance shipping073Sporting GoodsLos AngelesEE. UU.XXXXXXXXXTana19490TateXXXXXXXXXHome OfficeCA3200 Amber Bend90027.02Fitness34.125946-118.291016Pacific AsiaTownsvilleAustralia194901/13/2018 11:4575937136022.9400010.07179252327.750.081327.75304.80999822.860001OceaniaQueenslandCOMPLETENaN136073NaNhttp://images.acmesports.sports/Smart+watchSmart watch327.7501/16/2018 11:45Standard Class
4PAYMENT24134.210007298.250000Advance shipping073Sporting GoodsCaguasPuerto RicoXXXXXXXXXOrli19489HendricksXXXXXXXXXCorporatePR8671 Iron Anchor Corners725.02Fitness18.253769-66.037048Pacific AsiaTownsvilleAustralia194891/13/2018 11:2475936136029.5000000.09179251327.750.451327.75298.250000134.210007OceaniaQueenslandPENDING_PAYMENTNaN136073NaNhttp://images.acmesports.sports/Smart+watchSmart watch327.7501/15/2018 11:24Standard Class
5TRANSFER6418.580000294.980011Shipping canceled073Sporting GoodsTonawandaEE. UU.XXXXXXXXXKimberly19488FlowersXXXXXXXXXConsumerNY2122 Hazy Corner14150.02Fitness43.013969-78.879066Pacific AsiaToowoombaAustralia194881/13/2018 11:0375935136032.7799990.10179250327.750.061327.75294.98001118.580000OceaniaQueenslandCANCELEDNaN136073NaNhttp://images.acmesports.sports/Smart+watchSmart watch327.7501/19/2018 11:03Standard Class
6DEBIT2195.180000288.420013Late delivery173Sporting GoodsCaguasPuerto RicoXXXXXXXXXConstance19487TerrellXXXXXXXXXHome OfficePR1879 Green Pine Bank725.02Fitness18.242538-66.037056Pacific AsiaGuangzhouChina194871/13/2018 10:4275934136039.3300020.12179249327.750.331327.75288.42001395.180000Eastern AsiaGuangdongCOMPLETENaN136073NaNhttp://images.acmesports.sports/Smart+watchSmart watch327.7501/15/2018 10:42First Class
7TRANSFER2168.430000285.140015Late delivery173Sporting GoodsMiamiEE. UU.XXXXXXXXXErica19486StevensXXXXXXXXXCorporateFL7595 Cotton Log Row33162.02Fitness25.928869-80.162872Pacific AsiaGuangzhouChina194861/13/2018 10:2175933136042.6100010.13179248327.750.241327.75285.14001568.430000Eastern AsiaGuangdongPROCESSINGNaN136073NaNhttp://images.acmesports.sports/Smart+watchSmart watch327.7501/15/2018 10:21First Class
8CASH32133.720001278.589996Late delivery173Sporting GoodsCaguasPuerto RicoXXXXXXXXXNichole19485OlsenXXXXXXXXXCorporatePR2051 Dusty Route725.02Fitness18.233223-66.037056Pacific AsiaGuangzhouChina194851/13/2018 10:0075932136049.1600000.15179247327.750.481327.75278.589996133.720001Eastern AsiaGuangdongCLOSEDNaN136073NaNhttp://images.acmesports.sports/Smart+watchSmart watch327.7501/16/2018 10:00Second Class
9CASH21132.149994275.309998Late delivery173Sporting GoodsSan RamonEE. UU.XXXXXXXXXOprah19484DelacruzXXXXXXXXXCorporateCA9139 Blue Blossom Court94583.02Fitness37.773991-121.966629Pacific AsiaGuangzhouChina194841/13/2018 9:3975931136052.4399990.16179246327.750.481327.75275.309998132.149994Eastern AsiaGuangdongCLOSEDNaN136073NaNhttp://images.acmesports.sports/Smart+watchSmart watch327.7501/15/2018 9:39First Class

Last rows

TypeDays for shipping (real)Days for shipment (scheduled)Benefit per orderSales per customerDelivery StatusLate_delivery_riskCategory IdCategory NameCustomer CityCustomer CountryCustomer EmailCustomer FnameCustomer IdCustomer LnameCustomer PasswordCustomer SegmentCustomer StateCustomer StreetCustomer ZipcodeDepartment IdDepartment NameLatitudeLongitudeMarketOrder CityOrder CountryOrder Customer Idorder date (DateOrders)Order IdOrder Item Cardprod IdOrder Item DiscountOrder Item Discount RateOrder Item IdOrder Item Product PriceOrder Item Profit RatioOrder Item QuantitySalesOrder Item TotalOrder Profit Per OrderOrder RegionOrder StateOrder StatusOrder ZipcodeProduct Card IdProduct Category IdProduct DescriptionProduct ImageProduct NameProduct PriceProduct Statusshipping date (DateOrders)Shipping Mode
180509PAYMENT340.000000335.980011Advance shipping045FishingCaguasPuerto RicoXXXXXXXXXMelissa7WilcoxXXXXXXXXXCorporatePR9453 High Concession725.07Fan Shop18.359095-66.079956Pacific AsiaGuangshuiChina71/16/2016 6:4926052100464.00.1665202399.9800110.001399.980011335.9800110.000000Eastern AsiaHubeiPENDING_PAYMENTNaN100445NaNhttp://images.acmesports.sports/Field+%26+Stream+Sportsman+16+Gun+Fire+SafeField & Stream Sportsman 16 Gun Fire Safe399.98001101/19/2016 6:49Standard Class
180510PAYMENT34165.990005331.980011Advance shipping045FishingCaguasPuerto RicoXXXXXXXXXMelissa7WilcoxXXXXXXXXXCorporatePR9453 High Concession725.07Fan Shop18.359095-66.079956Pacific AsiaGuangshuiChina71/16/2016 6:4926052100468.00.1765201399.9800110.501399.980011331.980011165.990005Eastern AsiaHubeiPENDING_PAYMENTNaN100445NaNhttp://images.acmesports.sports/Field+%26+Stream+Sportsman+16+Gun+Fire+SafeField & Stream Sportsman 16 Gun Fire Safe399.98001101/19/2016 6:49Standard Class
180511DEBIT22157.429993327.980011Shipping on time045FishingChula VistaEE. UU.XXXXXXXXXOlivia9314SmithXXXXXXXXXConsumerCA3760 Stony Promenade91911.07Fan Shop32.611141-117.073662Pacific AsiaChengduChina93141/16/2016 6:2826051100472.00.1865195399.9800110.481399.980011327.980011157.429993Eastern AsiaSichuanON_HOLDNaN100445NaNhttp://images.acmesports.sports/Field+%26+Stream+Sportsman+16+Gun+Fire+SafeField & Stream Sportsman 16 Gun Fire Safe399.98001101/18/2016 6:28Second Class
180512DEBIT6486.400002319.980011Late delivery145FishingCaguasPuerto RicoXXXXXXXXXMary7396MaddenXXXXXXXXXHome OfficePR9918 Lazy Cape725.07Fan Shop18.245256-66.370621Pacific AsiaChengduChina73961/16/2016 6:0726050100480.00.2065194399.9800110.271399.980011319.98001186.400002Eastern AsiaSichuanCOMPLETENaN100445NaNhttp://images.acmesports.sports/Field+%26+Stream+Sportsman+16+Gun+Fire+SafeField & Stream Sportsman 16 Gun Fire Safe399.98001101/22/2016 6:07Standard Class
180513PAYMENT34119.989998299.989990Advance shipping045FishingLancasterEE. UU.XXXXXXXXXMary3080SmithXXXXXXXXXHome OfficeOH8600 Red Goose Abbey43130.07Fan Shop39.715977-82.599297Pacific AsiaShangháiChina30801/16/2016 5:04260471004100.00.2565185399.9800110.401399.980011299.989990119.989998Eastern AsiaShangháiPENDING_PAYMENTNaN100445NaNhttp://images.acmesports.sports/Field+%26+Stream+Sportsman+16+Gun+Fire+SafeField & Stream Sportsman 16 Gun Fire Safe399.98001101/19/2016 5:04Standard Class
180514CASH4440.000000399.980011Shipping on time045FishingBrooklynEE. UU.XXXXXXXXXMaria1005PetersonXXXXXXXXXHome OfficeNY1322 Broad Glade11207.07Fan Shop40.640930-73.942711Pacific AsiaShangháiChina10051/16/2016 3:402604310040.00.0065177399.9800110.101399.980011399.98001140.000000Eastern AsiaShangháiCLOSEDNaN100445NaNhttp://images.acmesports.sports/Field+%26+Stream+Sportsman+16+Gun+Fire+SafeField & Stream Sportsman 16 Gun Fire Safe399.98001101/20/2016 3:40Standard Class
180515DEBIT32-613.770019395.980011Late delivery145FishingBakersfieldEE. UU.XXXXXXXXXRonald9141ClarkXXXXXXXXXCorporateCA7330 Broad Apple Moor93304.07Fan Shop35.362545-119.018700Pacific AsiaHirakataJapón91411/16/2016 1:342603710044.00.0165161399.980011-1.551399.980011395.980011-613.770019Eastern AsiaOsakaCOMPLETENaN100445NaNhttp://images.acmesports.sports/Field+%26+Stream+Sportsman+16+Gun+Fire+SafeField & Stream Sportsman 16 Gun Fire Safe399.98001101/19/2016 1:34Second Class
180516TRANSFER54141.110001391.980011Late delivery145FishingBristolEE. UU.XXXXXXXXXJohn291SmithXXXXXXXXXCorporateCT97 Burning Landing6010.07Fan Shop41.629959-72.967155Pacific AsiaAdelaideAustralia2911/15/2016 21:002602410048.00.0265129399.9800110.361399.980011391.980011141.110001OceaniaAustralia del SurPENDINGNaN100445NaNhttp://images.acmesports.sports/Field+%26+Stream+Sportsman+16+Gun+Fire+SafeField & Stream Sportsman 16 Gun Fire Safe399.98001101/20/2016 21:00Standard Class
180517PAYMENT34186.229996387.980011Advance shipping045FishingCaguasPuerto RicoXXXXXXXXXMary2813SmithXXXXXXXXXConsumerPR2585 Silent Autumn Landing725.07Fan Shop18.213350-66.370575Pacific AsiaAdelaideAustralia28131/15/2016 20:1826022100412.00.0365126399.9800110.481399.980011387.980011186.229996OceaniaAustralia del SurPENDING_PAYMENTNaN100445NaNhttp://images.acmesports.sports/Field+%26+Stream+Sportsman+16+Gun+Fire+SafeField & Stream Sportsman 16 Gun Fire Safe399.98001101/18/2016 20:18Standard Class
180518PAYMENT44168.949997383.980011Shipping on time045FishingCaguasPuerto RicoXXXXXXXXXAndrea7547OrtegaXXXXXXXXXConsumerPR697 Little Meadow725.07Fan Shop18.290380-66.370613Pacific AsiaNagercoilIndia75471/15/2016 18:5426018100416.00.0465113399.9800110.441399.980011383.980011168.949997South AsiaTamil NaduPENDING_PAYMENTNaN100445NaNhttp://images.acmesports.sports/Field+%26+Stream+Sportsman+16+Gun+Fire+SafeField & Stream Sportsman 16 Gun Fire Safe399.98001101/19/2016 18:54Standard Class